NOVEL HUMAN PROTEASES AND POLYNUCLEOTIDES 
ENCODING THE SAME 



The present application claims the benefit of U.S. 
5 Provisional Application Number 60/ 174,68 6 v;hich was filed on 
January 6, 2000 and is herein incorporated by reference in its 
entirety . 

1 . INTRODUCTION 
The present invention relates to the discovery, 

10 identification, and characterization of novel human 

polynucleotides encoding proteins sharing sequence similarity with 
mammalian proteases. The invention encompasses the described 
polynucleotides, host cell expression systems, the encoded 
proteins, fusion proteins, polypeptides and peptides, antibodies 

15 to the encoded proteins and peptides, and genetically engineered 
animals that either lack 02' over express the disclosed sequences, 
antagonists and agonists of the proteins, and other compounds that 
modulate the expression or activity of the proteins encoded by the 
disclosed polynucleotides that can be used for diagnosis, drug 

20 screening, clinical trial monitoring and the treatment of 
physiologica 1 disorders . 

2 . BACKGROUND OF THE INVENTION 
Proteases cleave protein substrates as part of degradation, 
;.. 5 ma tu r a t 1 o 1 1 , a n c i s e c r e t o r y p a t h w a y s w i t h i n t h e body . Proteas e s 
: ... ; ' ; <\:~:; • • . ; w ; : ; : , : : : • ; : ci A • t requ 1 a t i ng • it vo. • . ,. 

: : iLer i i 1 !_y , modu i a t ing eel iular processes, fertility, and 
infectious disease. 

.:, J. SUMMARY OF THE INVENTION 



em snare structural s imi larity with animal 
proteases, and particularly trypsin-like serine proteases such as 
enteropept idase ( ent erokinase ) , plasminogen, and acrosin. 

The novel human nucleic acid ( c DMA ) sequences described 
5 herein, encode proteins / open reading frames (ORFs) of 217, 348, 
and 238 amino acids in length (see SEQ ID NOS : 2, 4, and 6 
respect ively ) . 

The invention also encompasses agonists and antagonists of 
the described NHPs , including small molecules, large molecules, 

10 mutant NHPs, or portions thereof that compete with native NHPs, 

NHP peptides, and NHP antibodies, as well as nucleotide sequences 
that can be used to mhibi t the expression of the described NHPs 
{e.g. , anti sense and r ibozyme molecules , and gene or regulatory 
sequence replacement constructs) or to enhance the expression of 

15 the described NHPs (e.g., expression constructs that place the 

described sequence under the control of a strong promoter system) , 
and transgenic animals that express a NHP transgene, or "knock- 
outs" (which can be conditional) that do not express a functional 
NHP . 

20 Further, the present invention also relates to processes for 

identifying compounds that modulate, i.e., act as agonists or 
antagonists, of NHP expression and 'or NHP activity that utilize 
purified preparations of the described NHP and/or NHP product, or 
cells expressing the same. Such compounds car: be used as 

". ;heiai)( v ;*. ic agents for the treatment of any of a v:..:.ao variety o: 
s y mp t om s associate d v; i : h b i u logical disorders o r r mb alances . 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
1 1 1 : o S o q u e 1 1 c e I... > 1 s t :. n g p i " o v I d e s the sequences of t h e N H P 0 R F s 



TrTE>ect for the first time herein, are novel 
proteins that are expressed in, inter alia, human cell lines, and 
human testis cells. 

The described sequences were compiled from gene trapped cDNAs 
5 and clones isolated from a human testis cDNA library (Edge 

Biosystems, Gai ther sburg , MD ) . The present invention encompasses 
the nucleotides presented in the Sequence Listing, host cells 
expressing such nucleotides, the expression products of such 
nucleotides, and: (a) nucleotides that encode mammalian homo logs 
10 of the described sequences, including the specifically described 
NH?s, and the NHP products; (b) nucleotides that encode one or 
more portions of a in HP that correspond to functional domains of 
the NHP , and the polypeptide products specified by such nucleotide 
sequences , including but not limi ted to the novel regions of any 
15 active domain (s); (c) isolated nucleotides that encode mutant 

versions, engineered or naturally occurring, of a described NHP in 
which all or a part of at least one domain is deleted or altered, 
and the polypeptide products specified by such nucleotide 
sequences, including but not limited to soluble proteins and 
20 peptides in which all or a portion of the signal sequence is 
deleted; (d) nucleotides that encode chimeric fusion proteins 
containing all or a portion of a coding region of a NHP, or one of 
its domains (e.g., a receptor or ligand binding domain, accessory 
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polynucleotides, ribozyrnes, dsRNA, or gene therapy constructs 
comprising a sequence first disclosed in the Sequence Listing. 

As discussed a b o v e , 1 1 a e pre s e n t i. n v e n tie n includes: ( a ) t h e 



yDriaizes to a complement of a DNA sequence 
presented in the Sequence Listing under highly stringent 
conditions, e.g., hybridization to filter-bound DNA in 0.5 M 
NaHPO ■ , 7% sodium dodecyl sulfate (SDS), 1 niM EDTA at 65 C, and 
5 washing in 0.1xSSC/0.1% SDS at 53 C (Ausubel F.M. et ai . , eds . , 
:i 9 89 , Current Protocols in Molecular Biology, Vol . 1 , Green 
Publishing Associates, Inc., and John Wiley & sons, Inc., New 
York, at p. 2.10.3) and encodes a functionally equivalent gene 
product. Additionally contemplated are any nucleotide sequences 
10 that hybridize to the complement of the DNA sequence that encode 
and express an amino acid sequence presented in the Sequence 
Discing under moderately stringent conditions, e.g., washing in 
u.2xSSC/0.1% SDS at 4 2 : C (Ausubel et al . , 1989, supra), yet still 
encode a functionally equivalent NHP product. Functional 
15 equivalents of a NHP include naturally occurring NHPs present in 
other species and mutant NHPs whether naturally occurring or 
engineered (by site directed mutagenesis, gene shuffling, directed 
evolution as described in, for example, U.S. Patent No. 
5,837, 4E8). The invention also includes degenerate nucleic acid 
20 variants of the disclosed NHP polynucleotide sequences. 

Additionally contemplated are polynucleotides encoding a NHP 
ORF , or its functional equivalent , encoded by a polynucleotide 
sequence that is about 99 , 95 , 90 , or about 35 percent similar or 
i d e n:i c . =r 1 t o c o r resp o ndi n g r egions oi t h e n u c 1 eot. i d e s e qu e n c e s o t 
1 the Sequence 1 Listing fas measured by BLAST sequence romparison 
c: n a n . v s i s u s i n g , f o r e ;■: a : v. 1 e , t. : ; e G Co s e qu en c ^ a na j_ y s i s p a c k a g e 
using standard default settings) . 

The invention also includes nucleic acid molecules, 
oref orab i y DNA .mo 1 oo* u 1 es , t.hat hybridi::e to, and are tnerefore ti:e 



uies are generally about 16 to about 100 bases long, or about 
2 0 to about 80, or about 3 4 to about 4 5 bases long, or any 
variation or combination of sizes represented therein that 
incorporate a contiguous region of sequence first disclosed in the 
Sequence Listing. Such oligonucleotides can be used in 
conjunction with the polymerase chain reaction (PGR) to screen 
libraries, isolate clones, and prepare cloning and sequencing 
t emp lates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing gene 
expression patterns (particularly using a micro array or high- 
Liuuughput "chip" format). Additionally, a series of the 
described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 
sequence first disclosed in at least a portion of one or more of 
the sequences of SEQ ID NOS: 1-7 can be used as a hybridization 
probe in conjunction with a solid support matrix / subs trate 
(resins, beads, membranes, plastics, polymers, metal or metallized 
substrates, crystalline or polycrys ta 1 1 i ne substrates, etc.). Of 
particular note are spatially addressable arrays (i.e., gene 
chips, microtiter plates, etc.) of oligonucleotides and 
po lynucleo tides , or correspondi ng ol i gopep tides and po lypept ides , 
wherein at. least one of the biopolymers present on tne spatially 
addressabl e a t r ay compr i ses an o 1 igonuc 1 eot. i de or po 1 ynac . eot :. de 
seauence firs" disclosed in a: least one oL tne sequences or Srlo 
ID NOS: 1.-7, or- an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support, matrices, and conducting binding studies thereon are 



ressaoie arrays comprising sequences first disclosed in 
SEQ ID N0S:l-7 can be used to identify and characterize the 
temporal and tissue specific expression of a sequence. These 
addressable arrays incorporate oligonucleotide sequences of 
5 sufficient length to confer the required specificity, yet be 

within the limitations of the production technology. The length 
of these probes is within a range of between about 8 to about 2 000 
nucleotides. Preferably the probes consist of 60 nucleotides and 
more preferably 2 5 nucleoti des from the sequences first disclosed 
10 in SEQ ID NOS : 1-7 . 

For example, a series of the described oligonucleotide 
sequences , or the complements thereof , can be used in chip* format 
to represent all or a portion of the described sequences. The 
oligonucleotides, typically between about 16 to about 40 (or any 
15 whole number within the stated range) nucleotides in length can 
partially overlap each other and/ or the sequence may be 
represented using oligonucleotides that do not overlap. 
Accordingly, the described polynucleotide sequences shall 
typically comprise at least about two or three distinct 
20 oligonucleotide sequences of at least about 3 nucleotides in 

length that are each first disc losed in the described Sequence 
Listing. Such oligonucleotide sequences can begin at any 
nucleotide present within a sequence in the Sequence Listing and 
oroeeed in either' a sense ( ■■ ' -to - 3' ) o r i €?n t a t i on vis a-v is the 
f . r d o s c t: i b e d s e <: t u enc o o r in a : : a r : i s e n s o o r L e n tat n ) ::. . 

Mi croa r ray -based ana 1 yc : s a 1 1 ows the d i. s cover y of uroa«.i 
patterns of genetic activity, providing new understanding of gene 
functions and generating novel and unexpected insight into 
t r c * n s c r i \ ;• t i o n a 1 p r o cos s e s a n d 1.) 1 o 1 o g ical m e c 1: a n i s m s . T h e u s o o i 



vei components or gene functions that manifest 
themselves as novel pheno types . 

Probes consisting of sequences first disclosed in SEQ ID 
M0S:l-7 can also be used in the identification, selection and 
5 validation of novel molecular targets for drug discovery. The use 
of these unique sequences per mi ts the direct conf irmat i on of drug 
targets and recognition of drug dependent changes in gene 
express ion that are modulated through pathways distinct f rom the 
drugs intended target. These unique sequences therefore also have 
10 utility in defining and monitoring both drug action and toxicity. 

As an example of utility, the sequences first disclosed in 
SEQ ID NOG: 1-7 can be utilized in microarrays or other assay 
formats, to screen collections of genetic material from patients 
who have a particular medical condition. These investigations can 
15 also be carried out using the sequences first disclosed in SEQ ID 
NOS:l-7 in silica and by comparing previously collected genetic 
databases and the disclosed sequences using computer software 
known to those in the art . 

Thus the sequences first disclosed in SEQ ID NOS:l-7 can be 
20 used to identify mutations associated with a particular disease 
and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
specifically described using nucleotide sequence, it should be 
appreciated that each of the sequences can uniquely be described 
\ \ 1 1 u s i n g an y o f a w : d e v a \ i e t y of add:* : on a i s t r u c :ural a t : r i. bu t o s , 
or combinations thereof. ror example" 1 , a given sequence can be 
described by the net composition of the nucleotides present within 
a given region of the sequence in conjunction with the presence of 
o n e or mo r e s p e e i f i c o 1 i g o n 1 : c 1 e o t i d e s e qu en c e ( s ) f i r s t d i s c 1 o s e d 



lven sequence. Such restriction maps, which are typically 
generated by widely available computer programs {e.g., the 
University of Wisconsin GCG sequence analysis package, SEQUENCHER 
3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can optionally be 
5 used in conjunction with one or more discrete nucleotide 

sequence (s) present in the sequence that can be described by the 
relative position of the sequence relatve to one or more 
additional sequence (s) or one or more restriction sites present in 
the disclosed sequence. 
10 For oligonucleotide probes, highly stringent conditions may 

refer, e.g., to washing in 5xSSC/0.05% sodium pyrophosphate at 
37 C (for 14-base oligos), 48 C (for 17-base oligos), 55°C (for 
20-base oligos), and 60"C (for 23-base oligos). These nucleic 
acid molecules may encode or act as NHP gene ant i sense molecules , 
15 useful, for example, in NHP gene regulation (for and/or as 

antisense primers in amplification reactions of NHP nucleic acid 
sequences) . With respect to NHP gene regulation, such techniques 
can be used to regulate biological functions. Further, such 
sequences may be used as part of ribozyme and/or triple helix 
20 sequences that are also useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides can 
additionally comprise at least one modified base moiety which is 
selected f rom the group inc ludi ng but not 1 i mi ted to 
r f luorou rac 11, 2 - bromourao i 1 , 5 chlorourac i 1 , b - i odourac i. 1 , 
2 :; hyooxan t h ine , xantine, 1 - ace t y 1 cy t o s 1 no , ::■ - ( ca rboxyl r/dr oxy ImtU .hy 1 > 
uracil , : : - ca rboxyme thy 1 am:. :iomo t ny 1 1- tuioui idme , 
b - carboxymethy laminome thy luraci 1 , dihydrourac.il , beta-D- 
galac tosy lqueosine , inosine , No - i sopenteny laden ine , 
\ - met hy 1 guanine , 1 - met hy 1 inosine, 2,2 ■- di me thy iguan ine , 



yrrnio-N6-isopentenyl adenine , uracil - 5- 
oxyacetic acid (v), wybu toxosine , pseudouraci 1 , queosine, 
2 - thiocy tosine , 5 -methyl - 2 - thiouraci 1 , 2 - thiouraci 1 , 4 - thiouraci 1 , 
5 -me thy luraci 1 , uracil-5-oxyacetic acid methyl ester , uracil- 
5 5-oxyacetic acid (v), 5 -methyl -2 - thiouraci 1 , 3 - ( 3 -amino - 3 -N- 2 - 
carboxypropyl ) uracil, (acp3)w / and 2 , 6 -diami nopur ine . 

The antisense oligonucleotide can also comprise at least one 
modified sugar moiety selected from che group including but not 
limited to arabinose, 2 - f luoroarabinose , xylulose, and hexose . 
10 In yet another embodiment, the antisense oligonucleotide will 

comprise at least one modified phosphate backbone selected from 
the group consisting of a phosphorothioate, a phosphorodi thioate , 
a phosphoramidothioate , a phosphoramida t e , a phosphordiamidate , a 
methy lphosphonate , an alkyl phosphotries ter , and a formacetal or 
15 analog thereof . 

In yet another embodiment, the antisense oligonucleotide is 
an a-anomeric oligonucleotide. An a-anomeric oligonucleotide 
forms specific double- stranded hybrids with complementary RNA in 
which, contrary to the usual 3-units, the strands run parallel to 
20 each other (Gautier et al . , 1987, Nucl . Acids Res. 15:6625-6641). 
The oligonucleotide is a 2 ' - 0 -methylribonucleo tide ( Inoue et al . , 
1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analogue (Inoue et al . , 1987, FEBS Lett. 215:327-330). 
Alternatively, double stranded RNA can be usee; to disrupt the 
2 r : ox] :>res s i or. and function of." a :. argeted NHP. 

Oligonucleotides o f ' : . • invent io:; can be synthesized by 
standard methods known in the art, e.g. by use of an automated DNA 
synthesizer (such as are commercially available from Biosearch, 
App 1 i ed B i o s y s t ems , e t c . ) . A s examp I e s , ph osphorothioate 



rrs (sarin et al . , 1988, Proc . Nat 1 . Acad . Sci . 
U.S.A. 85:7448-74 51), etc. 

Low stringency conditions are well known to those of skill in 
the art, and wi 1 1 vary predictably depending on the speci f ic 
5 organisms from which the library and the labeled sequences are 

derived . For guidance regarding such condi t ions see, for example , 
Sambrook et al . , 1989, Molecular Cloning, A Laboratory Manual (and 
periodic updates thereof), Cold Springs Harbor Press, N.Y.; and 
Ausubel et al . , 1989, Current Protocols in Molecular Biology, 
1 0 Green Publi shing Assoc ia tes and Wi ley Intersex ence , N . Y . 

Alternatively, suitably labeled NHP nucleotide probes can be 
used to screen a human genomic library using appropriately 
stringent conditions or by PCR . The i dent i ficat ion and 
characterization of human genomic clones is helpful for 
1 5 ident i f ying polymorph! sins ( including , but not 1 i mi ted to , 

nucleotide repeats , microsatelli te alleles , single nucleotide 
polymorphisms , or coding single nucleotide polymorphisms ) , 
determining the genomic structure of a given locus / allele, and 
designing diagnos t i c tests. For example , sequences derived from 
2 0 regions adjacent to the intron/exon boundaries of the human gene 
can be used to des i gn primers for use in ampl i fi cat ion assays to 
detect mutations within the exons , introns , splice sites (e.g., 
splice acceptor and/ or donor sites ) , etc . , that can be used in 
d iagnos:.i c s a n d ph a r ma c o g en om i c s . 
7 [ ; Further, a NHP homo 1 og can be isolated Lrom nucleic acid from 

an organ i sm o f inter us t by peri or mi ng PCR using two degenerate or 
"wobble " ol igonucleot ide primer pools designed on the basis of 
amino acid s equ en c e s w i t h i n t he NH P p roducts disci o s e d h e r e i n . 
The tempi at. e for the reaction may be total RNA , mRNA , and 'or cDNA 



can De suocloned and sequenced to ensure that 
the amplified sequences represent the sequence of the desired IJHP 
gene. The PCR fragment can then be used to isolate a full length 
cDMA clone by a variety of methods. For example, the amplified 
E fragment can be labeled and used to screen a cDNA library, such as 
a bacteriophage cDNA library. Alternatively, the labeled fragment 
can be used to isolate genomic clones via the screening of a 
genomic 1 ibrary . 

PCR technology can also be used to isolate full length cDUA 
id sequences. For' example, RNA can be isolated, following standard 
procedures, from an appropriate cellular or tissue source (i.e., 
one known, 01 suspected, to express a NHP sequence) . A reverse 
transcription (RT) reaction can be performed on the RNA using an 
oligonucleotide primer specific for the most 5' end of the 
15 amplified fragment for the priming of first strand synthesis. The 
resulting RNA/DNA hybrid may then be "tailed" using a standard 
terminal transferase reaction, the hybrid may be digested with 
RNase H , and second strand synthesis may then be primed with a 
complementary primer. Thus, cDNA sequences upstream of the 
20 amplified fragment can be isolated. For a review of cloning 
strategies that can be used, see e.g., Sambrook et al . , 1989, 
supra . 

A cDNA encoding a mutant NH? gene can be isolated, for 
example, by using PCR. In 'his case, the first: cDKA strand may be 
i [ - s vr:v. ho s i t ed by hybridising <ih oliqo-dT oligonucleotide to- irR.NA 
isolated from tissue known or sustocted to b^ expiessed m an 
individual putatively carrying a mutant NHP allele, and by 
extending the new strand with reverse transcriptase. The second 
strand of the cDNA is then synthesized using an oligonucleotide 



sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation ( s ) 
responsible for the loss or alteration o f function of the mutant 
NHP gene product can be ascertained . 
5 Alternatively , a genomic 1 ibrary can be constructed using DNA 

obtained from an individual suspected of or known to carry a 
mutant NHP allele (e.g., a person manifesting a NHP-associated 
pheno type such as, for example , obes i ty , high blood pressure , 
connective tissue disorders , infertility, etc . ) , or a cDNA library 
10 can be constructed using RJNTA from a tissue known , or suspected, to 
express a mutant NHP allele. A normal NHP gene, or any suitable 
fragment thereof , can then be labeled and used as a probe to 
rdenti f y the corresponding mutant NHP allele in such 1 ibrar ies . 
Clones containi ng mutant NHP gene sequences can then be puri f ied 
1 5 and sub j ec ted to sequence analysis according to methods well known 
to those skilled in the art. 

Addi t ional ly , an expression 1 ibrary can be constructed 
uti lizing cDNA synthesized from, for example , RNA isolated from a 
tissue known , or suspected, to express a mutant NHP allele in an 
2 0 individual suspected of or known to carry such a mutant allele. 

In this manner, gene products made by the putatively mutant tissue 
can be expressed and screened using standard antibody screening 
techniques in con j unction wi th antibodies rai sed against normal 
NHP product, as described below. (For screening techniques, see, 
1 b for example, Harlow, K. and lane, eds . , 1938, "Antibodies: A 

I-aboi a i o ry Manual", Col d Spring Harbor Press, Cold Spring Harbor.; 

Addi t ional ly , scr eening can be accompli shed by screening wi th 
labeled NHP fusion proteins , such as , f or example , alkaline 
p h osphatase -NHP o r K H P - a 1 k a 1 i n e p h o s p h a t. a s e f usi o r i p r o t e i n s . 
3 1 In cases where a C ;TD * : • • • ' • • ' 



eir reaction wi th such labeled antibodies 
can be puri f ied and sub j ec ted to sequence analysis according to 
methods well known in the art. 

The invention also encompasses ( a ) DNA vectors that contain 
5 any of the? foregoing NHP coding sequences and/or their complements 
{i.e., antisense) ; (b) DNA expression vectors that contain any of 
the foregoing NHP coding sequences opera tively associated wi th a 
regulatory element that directs the expression of the coding 
sequences (for example , baculo virus as described in U.S. Patent 
1 0 No . 5,869,336 herein i ncorporated by reference ) ; ( c ) genet ical ly 

engineered host cells that contain any o f the foregoing NHP coding 
sequences operative I. y associated with a regulatory element that 
directs the expression of the coding sequences in the host cell; 
and (d) genetically engineered host cells that express an 
15 endogenous NHP sequence under the control of an exogenously 

introduced regulatory element (i.e. , gene activation ) . As used 
herein, regulatory elements include, but are not limited to, 
inducible and non- inducible promoters, enhancers, operators and 
other elements known to those skilled in the art that drive and 
20 regulate expression. Such regulatory elements include but are not 
limited to the human cytomegalovirus ( hCMV) immediate early gene , 
regula table , viral elements (particularly retroviral LTR 
promoters) , the early or late promoters of SV40 adenovirus, the 
1 a c s y stem, the t rp s y s tern, t h e TA C s y s t e m , t h e TR C system, the 
2 ma j o r o ] ::■ e r a t o r a n d p r o mo t e r : egi o r i s o 1 p h a g e 1 a itb d a , * :: e c o : ; t. i " o 1 
regions of f d coat p r o t e ; n , t h e p r omo t. e r for 3 - ph o s pi 10 g 1 y c e r a t e 
kinase (PGK) , the promoters of acid phosphatase, and the promoters 
of the yeast a -ma ting factors. 

The present invention also encompasses antibodies and anti- 



arory sequence replacement constructs), or promote the 
expression of a NHP {e.g., expression constructs in which NHP 
coding sequences are operatively associated with expression 
control elements such as promoters, promoter / enhancer s , etc.). 
5 The NHPs or NHP peptides, NHP fusion proteins, NHP nucleotide 

sequences, antibodies, antagonists and agonists can be useful for 
the detection of mutant NHPs or inappropriately expressed NHPs for 
the diagnosis of disease. The NHP proteins or peptides, NHP 
fusion proteins, NHP nucleotide sequences, host ceil expression 
10 systems, antibodies, antagonists, agonists and genetically 

engineered cells and animals can be used for screening for drugs 
(or higri throughput screening of combinatorial libraries) 
effective in the treatment of the symptomatic or phenotypic 
manifestations of perturbing the normal function of a NHP in the 
15 body. The use of engineered host cells and/or animals may offer 
an advantage in that such systems allow not only for the 
identification of compounds that bind to the endogenous receptor 
for a NHP, but can also identify compounds that trigger NHP- 
mediated activities or pathways. 
20 Finally, the NHP products can be used as therapeutics. For 

example, soluble derivatives such as NHP peptides / domains 
corresponding to NHP, NHP fusion protein products (especially NHP- 
ig fusion proteins, i.e., fusions of a NHP, or a domain of a NHP, 
to an IgFc) , NH ? antibodies and ant i-idiotypic antibodies 
f. ;; ( i r: c 1 u d i. n g F a b f i - agmeu t s ) , ant a g o r: i s t s o ~ a g or: is:s ( inclu d i n g 
compounds that" modulate or act on downstream targets in a NHP- 
mediated pathway) can be used to directly treat diseases or 
disorders. For instance, the administration of an effective 
amount of soluble NHP, or a NHP- IgFc fusion protein or an anti- 



genetically engineered cells function as "bioreactors" in the body 
delivering a continuous supply of a NHP , a NHP peptide, or a NHP 
fusion protein to the body. Nucleotide constructs encoding 
functional NHP, mutant NHPs, as well as antisense and ribozyme 
molecules can also be used in "gene therapy" approaches for the 
modulation of NHP expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for treating 
biological disorders . 

Various aspects of the invention are described in greater 
10 detail in the subsections below. 

5.1 THE NHP SEQUENCES 
The cDNA sequences ( SEQ ID NO : 1, 3, and 5) and the 
corresponding deduced amino acid sequences of the described NHP 

15 are presented in the Sequence Listing. SEQ ID NO : 7 describes a 
NHP ORF as well as flanking regions. The NHP nucleotides were 
obtained from human cDNA libraries using probes and/or primers 
generated from human gene trapped sequence tags. Expression 
analysis has provided evidence that the described NHPs can be 

20 expressed in human tissue as well as gene trapped human cells. In 
addition, the described NHP sequences can contain a variety of 
polymorphisms such as at nucleotide 23 of SEQ ID NO : 3 and 
nucleotide 55 of SEQ ID NO : 3 which both can be a C or a T and can 
g i v < 2 r i s e t c > s i lent mu t a t i o n at < _ o rrespo n d i n g a m i n o a c i d p o s i t i o : i 

7 : iC of. S :•(,• LD NO : 4 or a tyr or his a:., amino acid position 19 of. SEQ 
TIL 1 NO : 4 . The described NII.P sequences can also contain G-A 
polymorphisms at nucleotide 379 of SEQ ID NO : 3 and nucleotide 
position 199 of SEQ ID NO : 5 which can give rise to a corresponding 
ala or t.hr at amino acid position 127 of SEQ I D NO : 4 , or residue 



NHPs , polypeptides, peptide fragments, mutated, truncated, or 
deleted forms of the NHPs, and/or NHP fusion proteins can be 
prepared for a variety of uses. These uses include, but are not 
limited to, the generation of antibodies, as reagents in 
5 diagnostic assays, for the identification of other cellular gene 
products related to a NHP, as reagents in assays for screening for 
compounds that can be as pharmaceutical reagents useful in the 
therapeutic treatment of mental, biological, or medical disorders 
and disease. 

10 The Sequence Listing discloses the amino acid sequence 

encoded by the described NHP polynucleotides. The NHPs display 
initiator methionines in DNA sequence contexts consistent with a 
translation initiation site, and display a consensus signal 
sequence . 

1 5 The NHP amino acid sequences o f the invention inc lude the 

amino acid sequences presented in the Sequence Listing as well as 
analogues and derivatives thereof, as well as any oligopeptide 
sequence of at least about 10-40, generally about 12-35, or about 
16-3 0 amino acids in length first disclosed in the Sequence 

20 Listing. Further, corresponding NHP homologues from other species 
are encompassed by the invention. In fact, any NHP encoded by the 
NHP nucleotide sequences described above are within the scope of 
the invention, as are any novel polynucleotide sequences encoding 
all or any novel portion of an amino acid sequence presented in 

2 [ - the Set^jcncc Listing. The cir^generai.e nature of the genetic oooe 
is wel I known, and, accordingly, each amino acid presented in the 
Sequence Listing, is generically representative of the well known 
nucleic acid "triplet" colon, or in many cases codons, that can 
e nco d e t h e ami n o acid . A s s u c h , a s c o n t e mp 1 a t e d h e r e i n , the ami n o 



nee ) are generical ly representative of all the various 
permutat ions and combinations of nucleic acid sequences that can 
encode such amino acid sequences. 

The invention also encompasses proteins that are f unc t ional ly 
5 equivalent to the NHPs encoded by the present ly described 

nuc leot ide sequences as j udged by any of a number of criteria, 
including , but not limited to , the ability to bind and cleave a 
substrate of a NHP , or the abi 1 i ty to effect an identical or 
compl ementary downstream pathway , or a change in cellular 
10 metabolism (e.g., proteolytic activity, ion flux, tyrosine 

phosphorylation , etc . ) . Such f unc t ional ly equivalent NHP proteins 
include, but are not limited to, additions or substitutions of 
amino acid residues within the amino acid sequence encoded by the 
NHP nucleotide sequences described above, but which result in a 
15 si lent change , thus producing a functionally equivalent gene 

product . Amino acid substi tut ions can be made on the basis of 
similarity in polarity, charge, solubility, hydrophobic! ty, 
hydrophi 1 ici ty , and/ or the amphipathic nature o f the residues 
involved . For example , nonpo lar ( hydrophobic ) amino acids inc lude 
20 alanine, leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan , and methionine ; polar neut ral amino acids include 
glycine , serine, threonine , cysteine , tyrosine , asparagine , and 
glutamine; positively charged (basic) amino acids include 
arginme, 1 y s i n o , and his t: i d i : i e ; a nd negative I y c h a t g e d (acidic) 
'. r am i n o ac ; d s : n o ] u d e a s pa r t; ic . i c i ci a:i d giu t a m i. c a c i d . 

A v a r l. e t y of host - ox: , r e s . ; i o r . v o ctor sysi em s can be used to 
express the NHP nucleotide sequences of the invention. Where, as 
in the present instance, the NHP products or NHP polypeptides are 
t h o u g h t t o b e s o 1 u b 1 e o r s e c r e t e d mo 1 e c u 1 e s , the p ep t i d e or 



mg appropriate detergents and lipid micelles and 
methods well known to those skilled in the art. However, such 
engineered host cells themselves may be used in si tua t ions where 
it is important not only to retain the structural and functional 
5 characteristics of the NHP, but to assess biological activity, 
e.g., in drug screening assays. 

The expression systems that may be used for purposes of the 
invention include but are not limited to microorganisms such as 
bacteria (e.g., E. coli, B. subtilis) transformed with recombinant 
10 bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing NHP nucleotide sequences ; yeast {e.g. , Saccharomyces , 
Pichia) t transformed wi th recombinant yeast expression vectors 
containing NHP encoding nuc leot ide sequences ; insect cell systems 
infected with recombinant virus expression vectors (e.g., 
15 baculovirus ) containing NHP sequences ; plant cell systems infected 
with recombinant virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV ; tobacco mosaic virus , TMV ) or trans formed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) 
containing NHP nucleotide sequences; or mammalian cell systems 
2 0 (e.g., COS , CHO , BHK , 2 9 3, 3T3 ) harboring recombinant expression 
constructs containing promoters der i ved from the genome of 
mammal i an cells (e.g., metal lothione in promoter ) or from mammal lan 
viruses (e.g., the adenovirus late promoter; the vaccinia virus 
7 . 5 K promoter ) . 

1 v Lr: bacter : ai systems , a number of expression vectors may be 

advantageous i y selected depending upon the use intended for the 
NHP product being expressed. For example, when a large quantity 
of such a protein is to be produced for the generation of 
p ha r ma ceutical c ompo sitions o f or contai n i n a NHP, ^ ^ v ° " ■ 



5 



10 



2:1791), i n which a NHP coding sequence may be ligated 
.ndividually into the vector - n frame with ^ ^ 

so that a fusion protein 1S produced; pl N vectors ( inouye & 
inouye, 1985, Nuclei Acids Res. 13:3101-3109; Van Heeke & 
Schuster, 1989, J. B iol. Chem. 2*, : 5503-5509, ; and the l lke . p GEX 
vectors (Pharmacia or American Type Culture Collection) can also 
be used to express foreign polypeptides as fusion proterns with 
glutathione S-transf erase (GST) . In general, such fusion proteins 
are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by eluticn in th. 
presence of free glutathione. The PGEX vectors are designed to 
delude thrombin or factor Xa protease cleavage sites so that the 
cloned target sequence product can be released from the GST 
15 moiety. 

in an insect system, Autograph* California* nuclear 
Polyhidrosis virus (AcNPV) 1S used as a vector to express foreign 
sequences. The virus grows in S P oclo P tera fru giperd a cells . A NHP 
coding sequence can be cloned individually into non-essential 
regions (for example the polyhedrin gene) of the virus and placed 
under control of an AcNPV promoter (for example the polyhedrin 
promoter). Successful insertion of NHP coding sequence will 
result in inactivation of the polyhedrin gene and production of 
non-occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene). These 
recombinant: Viruses are then used to infect Spodoptera frugiperda 
cells in which the inserted sequence is expressed (e g see 
Smith st al„ 1983, J. virol. 46: 584; Smith, U.S. Patent No 
4,215,051) . 



20 



igarea to an adenovirus transcription/ translation control 
complex, e.g., the lace promoter and tripartite leader sequence. 
This chimeric sequence may then be inserted in the adenovirus 
genome by in vi tro or in vi vo recombination . Insertion in a non- 
5 essential region of the viral genome (e.g., region El or E3 ) will 
result in a recombinant virus that is viable and capable of 
expressing a NHP product in infected hosts (e.g., See Logan & 
Shenk, 1984, Proc . Natl. Acad. Sci . USA 81:3655-3659). Specific 
initiation signals may also be required for efficient translation 
10 of inserted NHP nucleotide sequences. These signals include the 
ATG initiation codon and adjacent sequences. In cases where an 
entire NHP gene or c DMA , including its own initiation codon and 
adjacent sequences, is inserted into the appropriate expression 
vector, no additional transla t ional control signals may be needed. 
15 However, in cases where only a portion of a NHP coding sequence is 
inserted, exogenous translational control signals, including, 
perhaps, the ATG initiation codon, must be provided. Furthermore, 
the ini tiat ion codon must be in phase wi th the reading frame of 
the desired coding sequence to ensure translation of the entire 
20 insert. These exogenous translational control signals and 

initiation codons can be of a variety of origins, both natural and 
synthetic. The efficiency of expression can be enhanced by the 
inclusion of appropriate transcription enhancer elements, 
transcription terminators, e:.c. (See Bit trier et. al . , 1987, Methods 
1 : , in En ymo 1 . 1 i 5 : 5 1 o - 5 4 4 ) . 

in add! t ion , a host ceil strain may be chosen that modulates 
the expression of the inserted sequences, or modifies and 
processes the gene product in the specific fashion desired. Such 
modifications (e.g., glycosylat ion ) and processing (e.c7., 



st systems can be chosen to ensure the correct 
modification and processing of the foreign protein expressed. To 
this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, 
5 glycosylat ion , and phosphorylation of the gene product may be 

used. Such mammalian host cells include, but are not limited to, 
CHO, VERO, BHK, HeLa , COS, MDCK, 2 93, 3T3 , WI3 8, and in 
particular, human cell lines. 

For long-term, high-yield production of recombinant proteins, 
10 stable expression is preferred. For example, cell lines which 
stably express the NHP sequences described above can be 
engineered. Rather than using expression vectors which contain 
viral origins of replication, host cells can be transformed with 
DNA controlled by appropriate expression control elements (e.g., 
15 promoter, enhancer sequences, transcription terminators, 

polyadenylat ion sites, etc.), and a selectable marker. Following 
the introduction of the foreign DNA, engineered cells may be 
allowed to grow for 1-2 days in an enriched media, and then are 
swi tched to a selective media . The selectable marker in the 
20 recombinant plasmid confers resistance to the selection and allows 
cells to stably integrate the plasmid into their chromosomes and 
grow to form foci which in turn can be cloned and expanded into 
cell lines. This method may ad van t ageou sly be used to engineer 
cell 1 i n e s wh i. c h e x p ress t h e Nil P product. S u c h e n g i n e e r e d cell 
2 lines may be i:>art icularly useful in screening and evaluation of 

con ip o u n d s that a f f e c t 1 1: o e r. < 1 o < j e n ous activity of the NHP product. 

A number of selection systems may be used, including but not 
limited to the herpes simplex virus thymidine kinase (Wigler, et 
a 1 . , 1 97 7 , Col 1 2 1: 22 3 ) , hypoxarit hi ne - guanine 



me resistance can be used as the basis of selection 
for the following genes: dhfr, which confers resistance to 
methotrexate (Wigler, et al . , 1980, Natl. Acad. Sci . USA 77:3567; 
O'Hare, et al . , 1981, Proc . Natl. Acad. Sci. USA 78:1527); gpt, 
5 which confers resistance to mycophenolic acid (Mulligan & Berg, 
1981, Proc. Natl. Acad. Sci. USA 78:2072); neo , which confers 
resistance to the aminoglycoside G-4.13 { Colberre-Garapin , et al . , 
1981, J. Mol . Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al . , 1984, Gene 30:147) . 
10 Alternatively, any fusion protein can be readily purified by 

utilizing an antibody specific for the fusion protein being 
expressed. For example, a system described by Janknecht et al . 
allows for the ready purification of non-denatured fusion proteins 
expressed in human cell lines (Janknecht, et al . , 1991, Proc. 
15 Natl. Acad. Sci. USA 88:8972-8976). In this system, the sequence 
of interest is subcloned into a vaccinia recombination plasmid 
such that the sequence's open reading frame is translationally 
fused to an amino- terminal tag consisting of six histidine 
residues . Extracts from cells infected wi th recombinant vaccinia 
20 virus are loaded onto Ki"' • nitriloacetic acid-agarose columns and 
hi stidine- tagged proteins are selectively eluted with imidazole- 
containing buffers . 

Also encompassed by the present invention are fusion proteins 
t ha t d i r e ■:: t t he NHP to a t a r g e t organ and/or facilitate t r a n s po r t 
2 : , across the membrane into the cyt.osol . Conjugation of NHPs to 

antibody molecules or their rab fragments could be used to target 
cells bearing a particular epitope. Attaching the appropriate 
signal sequence to the NHP would also transport the NHP to the 
d e sire d 1 o cati on v; i t h i n the cell . Alternati v e 1 y t a r a o t : n a o ^ n T r 1 ^ 



and 6,110,490 and their respective 
disclosures which are herein incorporated by reference in their 
entirety. Additionally embodied are novel protein constructs 
engineered in such a way that they facilitate transport of the NHP 
5 to the target site or desired organ, where they cross the cell 

membrane and/or the nucleus where the NHP can exert its functional 
activity. This goal may be achieved by coupling of the NHP to a 
cytokine or other ligand that provides targeting speci f icity , 
and/or to a protein transducing domain (see generally U.S. 
10 applications Ser. No. 60/111,701 and 60/056,713, both of which are 
herein incorporated by reference , for examples of such transducing 
sequences ) uo facilitate passage across cellular membranes and can 
optionally be engineered to include nuclear localization 
sequences . 

15 

5.3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more epitopes 
of a NHP , or epi topes of conserved variants of a NHP , or peptide 
fragments of a NHP are also encompassed by the invention . Such 

20 antibodies include but are not limited to polyclonal antibodies, 
monoclonal antibodies (mAbs), humanized or chimeric antibodies, 
single chain antibodies, Fab fragments, F(ab') : . fragments, 
f ragmen t s produced by a Fab expression 1 ibrary , ant i - idiotypic 
(anti-Id) antibodies, arid epi tope-binding fragments of any of trie 

fl : above . 

Th'=- antibodies of the invention may bo used, for example, in 
the detection of NHP in a biological sample and may, therefore, be 
u t i 1 i zed as part of a diagnos t i c or prognost i c technique whereby 
patients may b e t e s t e d f o r a bn o r ma 1 a mo u n t s of NHP. Sue h 



o , ror example , evaluate the normal and/ or 
engineered NHP-expressing cells prior to their introduction into 
the patient. Such antibodies may additionally be used as a method 
for the inhibition of abnormal NHP activity. Thus, such 
5 antibodies may, therefore, be utilized as part of treatment 
methods . 

For the production of antibodies, various host animals may be 
immunized by injection with the NHP, an NHP peptide (e.g., one 
corresponding the a functional domain of an NHP) , truncated NHP 

10 polypeptides (NHP in which one or more domains have been deleted) , 
functional equivalents of the NHP or mutated variant of the NHP. 
Such host animals may include but are not limited to pigs, 
rabbits, mice, goats, and rats, to name but a few. Various 
adjuvants may be used to increase the immunological response, 

15 depending on the host species, including but not limited to 

Freund's adjuvant (complete and incomplete), mineral salts such as 
aluminum hydroxide or aluminum phosphate, surface active 
substances such as lysoleci thin , pluronic polyols, polyanions, 
peptides, oil emulsions, and potentially useful human adjuvants 

20 such as BCG (bacille Calmet te-Guer in ) and Corynebacterium parvum. 
Alternatively, the immune response could be enhanced by 
combination and or coupling with molecules such as keyhole limpet 
hemocyanin, tetanus toxoid, diptheria toxoid, ovalbumin, cholera 
toxin or fragments thereof. Polyclonal antibodies are 

?. r heterogeneous populations of antibody molecules derived from the 
s e r a of t n e i mmu n i z e d a n \ ma 1 s . 

Monoclonal antibodies, which are homogeneous populations of 
antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody molecules 



, Proc . Natl. Acad. Sci . USA 80:2026-2030), 
and the EBV-hybr idoma technique (Cole et al . , 1985, Monoclonal 
Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) . 
Such antibodies may be of any immunoglobulin class including IgG, 
5 IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 

producing the mAb of this invention may be cultivated in vitro or 
in vivo. Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. 

In addition, techniques developed fox" the production of 
10 "chimeric antibodies" (Morrison et al . , 1984, Proc. Natl. Acad. 
Sci., 81:6851-6855; Neuberger et al . , 1934, Nature, 312:604-608; 
Takeda et al . , 1985, Nature, 314:452-454) by splicing the genes 
from a mouse antibody molecule of appropriate antigen specificity 
together with genes from a human antibody molecule of appropriate 
15 biological activity can be used. A chimeric antibody is a 

molecule in which different portions are derived from different 
animal species, such as those having a variable region derived 
from a murine mAb and a human immunoglobulin constant region. 
Such technologies are described in U.S. Patents Nos . 6,075,131 and 
20 5,877,397 and their respective disclosures which are herein 
incorporated by reference in their entirety. 

Alternatively, techniques described for the production of 
single chain antibodies (U.S. Patent 4,946,778; Bird, :.983, 
Science 24 2 : 4 2 3 - 4 2 o ; Huston et al . , 1 988, Proc. Natl. Acad. Sci. 
2 : USA 33 : 537 j ■ 538 3 ; and Ward et al . , 1939, Nature 331: 544 546) can 

bf_> adapted to produce single chain antibodies against NHP sequence 
products. Single chain antibodies are formed by linking the heavy 
and light chain fragments of the Fv region via an amino acid 
bridge, resulting in a single chain po 1 ypep t i d e . 



icn can De generated by reducing the disulfide 
bridges of the F(ab'). fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al . , 1989, Science, 
246:1275-1281) to allow rapid and easy identification of 
5 monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
ant i - idiotype antibodies that "mimic" a given NHP, using 
techniques well known to those skilled in the art. (See, e.g., 
Greenspan & Bona, 1993, FASEB J 7 ( 5 ) : 4 3 7 - 4 44 ; and Nissinoff, 1991, 
10 J. Immunol. 147 (8) :2429-2438) . For example antibodies which bind 
to a NHP domain and competitively inhibit the bindmq of NHP to 
its cognate receptor can be used to generate ant i - idio types that 
"mimic" the NHP and, therefore, bind and activate or neutralize a 
receptor. Such anti - idiotypic antibodies or Fab fragments of such 
15 ant i - idiotypes can be used in therapeutic regimens involving a NHP 
signaling pathway . 

The present invention is not to be limited in scope by the 
specific embodiments described herein, which are intended as 
single illustrations of individual aspects of the invention, and 
20 functionally equivalent methods and components are within the 
scope of the invention. Indeed, various modifications of the 
invention, in addition to those shown and described herein will 
become apparent to those skilled in the art from the foregoing 
d e s c r i p t l o n . Sue h m o d i Lications a r e mtende d t o fall w ithm t h e 
'.. [ scope o i the appended c L a .: its . ALL c i i. uci publications, patents, 
and patnrn applications c .;i e iitieii; incorporated by reference in 
their entirety . 



